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Abstract 

We describe a new sampling-based method to determine cuts in an undirected graph. For a graph 
(V, E), its cycle space is the family of all subsets of E that have even degree at each vertex. We prove 
that with high probability, sampling the cycle space identifies the cuts of a graph. This leads to simple 
new linear-time sequential algorithms for finding all cut edges and cut pairs (a set of 2 edges that form 
a cut) of a graph. 

In the model of distributed computing in a graph G = (V,E) with 0(log |V|)-bit messages, our 
approach yields faster algorithms for several problems. The diameter of G is denoted by T>, and the 
maximum degree by A. We obtain simple 0(X>)-time distributed algorithms to find all cut edges, 2- 
cdgc-connected components, and cut pairs, matching or improving upon previous time bounds. Under 
natural conditions these new algorithms are universally optimal — i.e. a f2(D)-time lower bound holds 
on every graph. We obtain a 0(T> + A/ log | V|)-time distributed algorithm for finding cut vertices; 
this is faster than the best previous algorithm when A, T> — 0(->J\V\). A simple extension of our 
work yields the first distributed algorithm with sub-linear time for 3-edge-connected components. The 
basic distributed algorithms are Monte Carlo, but they can be made Las Vegas without increasing the 
asymptotic complexity. 

In the model of parallel computing on the EREW PRAM our approach yields a simple algorithm with 
optimal time complexity 0(\ogV) for finding cut pairs and 3-edge-connected components. 

1 Introduction 

Let G = (V, E) be a connected undirected graph. A part of G is said to be a cut if, after deleting it from G, 
the remaining graph is disconnected. We use the following terminology: 

• A cut vertex is a vertex v such that {v} is a cut. 

• A cut edge is an edge e such that {e} is a cut (i.e., a bridge). 

• A cut pair is a cut consisting of two edges e, / such that neither e nor / is a cut edge. 

For brevity we call all of these objects small cuts. In a network (e.g., for communication or transportation), 
the small cuts are relevant because they represent the critical points where local failures can cause global 
disruption. Our primary motivation is to efficiently find all small cuts of an undirected graph. We study 
this problem in the sequential, distributed, and parallel models of computation. 

The fundamentally new idea in this paper is to identify cuts by sampling the cycle space. For a graph 
(V, E) we say that cf> C E is a binary circulation if every vertex has even degree in (V,4>); the cycle space 
of graph (V, E) is the set of all its binary circulations. For S C V, let 5(S) denote the edges with exactly 
one end in S. An induced edge cut is a set of the form 5(S) for some S; cut edges and cut pairs are induced 

*A preliminary version of this work appeared as "Fast Distributed Computation of Cuts via Random Circulations" in 
Proc. 35th ICALP [PriOS]. 
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edge cuts 1 . The family of all induced edge cuts is called the cut space of a graph. The cycle space and 
cut space are orthogonally complementary vector subspaces of Zf (see Section 2), which implies that the 
intersection of any binary circulation and induced edge cut is of even size. At a high level, our algorithms 
depend on a probabilistic converse (Proposition 2.5): if F C E is not an induced edge cut, the number of 
edges of F intersecting a uniformly random binary circulation is even with probability exactly 1/2. This 
specific observation seems new, although it is a simple consequence of standard results on the cut and cycle 
spaces. Using this observation we give efficient algorithms to sample a uniformly random binary circulation 
in the sequential, parallel, and distributed models of computing. 

The Distributed Model. Our approach improves several known time bounds in the distributed com- 
puting model with congestion. The precise model, denoted CONGEST [PelOO, §2.3], works as follows. The 
computation takes place in the graph G — (V, E) where each vertex is a computer and each edge is a bidi- 
rectional communication link; i.e., we study the problem of having a network compute the small cuts of its 
own topology. There is no globally shared memory, only local memory at each vertex. Initially only local 
topology is known: each vertex knows its ID value, which is unique, and its neighbours' IDs. Time elapses 
in discrete rounds. In each round, every vertex performs local computations and may send one message to 
each of its neighbors, to be received at the start of the next round. The time complexity of a distributed 
algorithm is the number of rounds that elapse, and the message complexity is the total number of messages 
that are sent. 

In the CONGEST model, every message must be at most 0(log V) bits long. The model does not bound 
the memory capacity or computational power of the vertices, although our algorithms use time and space 
polynomial in \V\ at each vertex. Let V denote the diameter of (V,E), i.e. V := maxu^ev distc^ti, v). The 
message size bound, in addition to making the algorithms more practical, affects what is possible in the model, 
as the following example from [LPSP06] shows. On the one hand, if messages are allowed to be arbitrarily 
long, any graph property whatsoever can be trivially computed in T> time 2 . On the other hand, Lotker et 
al. gave a family of graphs with T> = 3, such that in CONGEST on this family, a f2(-^/|V|/ y/log \V\)-time 
lower bound holds to find the minimum spanning tree (MST). 

A distributed time complexity faster than 0(V) on some graphs is called sub-linear. Determining whether 
a task in this model can be accomplished in sub-linear time, or better yet 0(T>) time, is a fundamental 
problem. E.g. one breakthrough was a sub-linear MST algorithm [GKP98] which was later improved [KP98] 
to time complexity 0(1? 4- y/\V\ log* | V|) — here log* x is the number of times which log must be iteratively 
applied to x before obtaining a number less than 1. Our breakthroughs in this regard are 0(T>) time 
algorithms for cut pairs, cut edges, and 2-edge-connected components, and a sub-linear algorithm for 3-edge- 
connected components. 

1.1 Existing Results 

Our results apply to three common models of computation: sequential, distributed, and parallel. Abusing 
notation for readability, we sometimes abbreviate |V| to V and \E\ to E. 

Sequential. In the usual sequential (RAM) model of computing, Tarjan was the first to obtain linear- 
time (0(V + E)-time) algorithms to find all cut vertices [Tar72], cut edges [Tar72], and cut vertex-pairs (cuts 
C CV with \C\ = 2) [HT73]. These algorithms are based on depth-first search (DFS). Galil & Italiano [GI91] 
gave the first linear-time algorithm to compute all cut pairs, by reducing to the cut vertex-pair problem. 

Distributed. Here we only mention results valid in CONGEST, ignoring results with Q(n) message size 
such as one of [Cha82]. Cut Edges/ Vertices. Two early distributed algorithms for cut edges and vertices, 
in [AZ89] and [Hoh90], use DFS. The smallest time complexity of any known distributed DFS algorithm is 
Q(V); as such, the algorithms of Ahuja & Zhu and Hohberg have ft(V) time complexity. Huang [Hua89] 
gave a non-DFS-based algorithm with 0(V) time complexity. The first sub-linear distributed algorithms for 
any type of small cuts appear in [Thu97]; using an MST subroutine, Thurimella obtained time complexity 
0(V + ^/V log* V) for both cut edges and cut vertices. Cut Pairs. For cut pairs, [JM9G] gave a distributed 

1 Our convention is that <5(0) = S(V) = is an induced edge cut — so we don't in general assume <5(5) is a cut. 
2 In T> rounds each vertex broadcasts its local topology to all other vertices, then each vertex deduces the global topology 
and solves the problem with a local computation. 
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algorithm with worst-case time and message complexity 0(rt 3 ), and [Tsi06] obtained a DFS-based algorithm 
with improved time complexity 0(T> 2 + V). 

Distributed Optimality. Distributed 0(V)-time algorithms for cut edges are optimal (up to a constant 
factor) on some graphs: e.g. it is straightforward to see, even guaranteed that G is either a |T^|-cycle or a 
|V|-path, not all edges can determine if they are cut edges in less than \V\J2 — 2 rounds. One term for 
this property is existentially optimal, due to [GKP98]. However, as Thurimella's algorithm [Thu97] showed, 
there are some graphs on which Q(V) time is not asymptotically optimal. The stronger term universally 
optimal [GKP98] applies to algorithms which, on every graph, have running time within a constant factor 
of the minimum possible. 

Parallel. In the PRAM model, an optimal 0(log V)-time and 0(V + i?)-work Las Vegas algorithm for 
cut edges and vertices was obtained in [TV85] (provided that for spanning forests, recent work of [HZ01] is 
used). For cut pairs, it may be possible to combine a a 3- vertex-connectivity algorithm of [FRT93] with the 
reduction of [GI91] (and spanning forest routines from [HZ01]) to yield a time- and work-optimal EREW 
algorithm. This is mentioned as a "future application" by Halperin & Zwick. However, this approach appears 
not to have been fully analyzed and is fairly complicated. 

1.2 Our Contributions 

Since our algorithms are randomized, we differentiate between two types of algorithms: Monte Carlo ones 
have deterministically bounded running time but may be incorrect with probability at most 1/V and Las 
Vegas ones are always correct and have bounded expected running time 3 . (Note, a Las Vegas algorithm can 
always be converted to Monte Carlo, so Las Vegas is generally better). 

Sequential. The random circulation approach yields new linear-time algorithms to compute all cut edges 
and cut pairs of the Las Vegas type. As far as we are aware, our linear-time cut pair algorithm is the first 
one that does not rely on either DFS (e.g., see references in [Tsi05]) or open ear decomposition (e.g., see 
references in [FRT93]). 

Distributed. We remark that all existing distributed algorithms mentioned for finding small cuts are 
deterministic. The random circulation approach yields faster distributed algorithms for small cuts of the Las 
Vegas type. For cut edges and pairs, we obtain 0(2?)-time algorithms. Compared to the previous best time 
of 0(T> + y/Vlog* V) for cut edges, we remove the dependence on |V|. Compared to the previous best time 
of 0(T> 2 + V) for cut pairs, we obtain a quadratic speedup on every graph. For cut vertices, we obtain a 
0(T> + A/logF)-time algorithm where A is the maximum degree. Compared to the previous best time of 
0(T> + y/V log* V) for cut vertices, this is faster on graphs with A,V = O(VV). We also obtain the first sub- 
linear distributed algorithm for 3-edge-connected components, using a connected components subroutine of 
[Thu97]. In Table 1 we depict our main results and earlier work, showing both time and message complexity. 

Universal Optimality. If we assume distributed algorithms must act globally in a natural sense - 
either by initiating at a single vertex, or by reporting termination — then a il(2?)-time lower bound holds for 
the problems of finding cut edges or cut pairs, on any graph. Hence under natural conditions, our 0(2?)-time 
algorithms for cut edges and cut pairs are universally optimal. 

Parallel. In the PRAM model, we obtain a Las Vegas algorithm for cut pairs and 3-edge-connected 
components with time complexity 0{\ogV + T(E)), space complexity 0(E + S(E)), and work complexity 
0(E + W(E)), where T(n), S(n), W{n) are respectively the time, space, work complexity to sort n numbers 
of length O(logrt) bits. E.g. on the EREW PRAM, we can implement our algorithm in 0(log V) time, 0(E) 
space and O(ElagE) work using a sorting subroutine of [KRS90], or in 0(logV) time, 0(E 1+e ) space and 
0(E Vlog E) work using a subroutine of [HS02]. 

1.3 Other Related Work 

Randomized algorithms appear in other literature related to the cut and cycle spaces. For example, [BL03] 
computes the genus of an embedded graph G while "observing" part of it. They use random perturbation 

3 More generally, our algorithms can obtain error probability < 1/V C for any constant c without changing the asymptotic 
complexity. 
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Cuts Found 



Time 



Messages 



[AZ89] 

[Thu97] 

[Tsi06] 
Theorem 4.3f 
Theorem 4.10f 
Theorem 4.4f 



Vertices & Edges 
Vertices & Edges 

Pairs 

Edges 

Pairs 
Vertices 



0(V) 
0(V + VV\og* V) 
0{V + V 2 ) 

0(V) 

0(2?) 
0(V + A/ logy) 



0(E) 

0(E-(V + VV\og* V)) 
0(E + V-V) 
0(E) 
0(min{V 2 , £■£>}) 
0(E(1 + A/ log V)) 



Table 1: Comparison of our three main distributed results (denoted by f) to the best previously known 
algorithms. 



and balancing steps to compute a near- circulation on G and the dual graph of G. Their computational 
model is quite different from the one here, e.g. they allow a face to modify the values of all its incident edges 
in a single time step. 

A slow bridge-finding algorithm based on random walks is given in [PV06], which inspires this paper. 
Random sampling is a fruitful technique to quickly compute so-called minimum cycle bases of the cycle 
space, e.g. see the survey [KLM+09]. 

1.4 Organization of the Paper 

In Section 2 we define random circulations and show how to construct them efficiently. In Section 3 we show 
how random circulations yield algorithms for small cuts and give sequential implementations. In Section 
4 we precisely define the assumptions in our distributed model and give the Monte Carlo algorithms; we 
introduce a technique called fundamental cycle-cast which may be of independent interest. In Section 5 we 
discuss 2- and 3-edge-connected components. In Section 6 we give the Las Vegas analogues of our distributed 
algorithms. We give Q(T>) distributed time lower bounds under precise assumptions in Section 7. We give 
the parallel cut pair algorithm in Section 8. 

2 Preliminaries on Circulations 

The cut space and cycle space over Z in directed graphs have been studied for quite some time [BM76] . For 
our purposes it is convenient to work modulo 2; then, informally, we can deal with undirected graphs since 
+1 = —1 (mod 2). For the sake of completeness, we prove the needed results. See also [Die06] which proves 
material equivalent to Propositions 2.1, 2.2, and 2.3. 

For notational convenience we identify any subset S of E with its characteristic vector \ S & defined 
by xf = 1 f° r e € S and xf = for e ^ S. We use © to stand for vector addition modulo 2, so in accordance 
with our notational convention, for S, T C E the expression S © T denotes the symmetric difference of S 
and T. 

As mentioned earlier, <j> C E is a binary circulation if in (V, (f) every vertex has even degree; the cycle 
space of graph (V, E) is the set of all its binary circulations; 5(S) denotes the edges of G with exactly one 
end in S\ an induced edge cut is a set of the form S(S) for some S; and the family of all induced edge cuts 
is called the cut space of a graph. For v E V we use 8(v) as short for 6({v}). 

Proposition 2.1. The cut space and cycle space are vector subspaces ofE®. 

Proof. Note it suffices to show each space contains and is closed under ©. For the cut space, this holds 
since 6(0) = and S(S ®T) = S(S) © S(T). For the cycle space, clearly (V, 0) has even degree at each 
vertex; and if (V, <j>\) and (V, ^2) have even degree at each vertex, then the degree of vertex v in (V, <j>\ © cfe) 
is deg0 x (v) + deg^ 2 (v) - 2 dcg^ in<fa (v) = + - (mod 2), so (j>\ © 4> 2 is a binary circulation. □ 

Proposition 2.2. The cut space and cycle space are orthogonal. 
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Proof. We need precisely to show that for any binary circulation <p and any S C V that the dot product 
<p ■ S(S) = (mod 2), or equivalently that \<p H 8{S)\ is even. Now J2 s es ^ e S<j>( s ) = £ s eS 1^ n ^( s )l an< ^ tne 
former quantity is even since ^ is a circulation. The latter sum counts every edge of <p (1 <$(<S) once, every 
edge of <p with both ends in S twice, and every other edge zero times. Since this sum is even, \<f> n 5{S)\ is 
even. □ 

In the next proposition, we assume G is connected, and hence has a spanning tree T. We need to define 
the fundamental cuts and fundamental cycles of T. For each edge e of E\E(T), we define the fundamental 
cycle C e to be the unique cycle in T U {e}. Note cycles are binary circulations. For each edge e of T, we 
define S e to be one of the two connected components of T\e, and define the fundamental cut of e to be S(S e ) 
(note S(S e ) does not depend on which connected component we chose). 

Proposition 2.3. (a) The cut space and cycle space are orthogonal complements, (b) The cycle space has 
dimension \E\ — \V\ + 1 and the cut space has dimension \V\ — 1. (c) For any spanning tree T of G, its 
fundamental cycles form a basis of the cycle space, and its fundamental cuts form a basis of the cut space. 

Proof. We will show that the \E\ — \V\ + 1 fundamental cycles are linearly independent in the cycle space 
and the |V| — 1 fundamental cuts are linearly independent in the cut space. Basic linear algebra shows 
the sum of the dimensions of two orthogonal subspaces of Z§ ^ s &t most \E\, with equality only if they are 
orthogonal complements, thus by Proposition 2.2, Proposition 2.3(a) and (b) follow, and so does (c). We 
use the following claim. 

Claim 2.4. Let H C E and consider a family of vectors {x e } e ^H over Z% ■ If x% = 1 for all e £ H, and 
x e j = for all distinct e, / € H , then {x e } et =H is linearly independent. 

Proof. Suppose for the sake of contradiction that Q) eeH a e x e is the zero vector, where a e € {0, 1} for each 
e and not all a e are zero. Pick / such that o/ = 1, then ^2 eeH a e x e j = 1, a contradiction. □ 

Note that e G C e but for any other edge / of E\E(T), f <jL C e , so by Claim 2.4 with H = E\E{T) 
and x e = C e , these vectors are linearly independent. Note that e g S(S e ) but for any other edge / of T, 
/ ^ 8(S e ), so by Claim 2.4 with H = E(T) and x e = S(S e ), these vectors are linearly independent. This 
completes the proof of Proposition 2.3. □ 



2.1 Random Circulations 

Next we show why uniform sampling of the cycle space is useful for identifying cuts. 

Proposition 2.5. Let F C E be a set that is not an induced edge cut. If (p is a uniformly random binary 
circulation, then Pr[|_Fn <p\ is even] = 1/2. 

Proof. Since F is not in the cut space, by Proposition 2.3(a) it is not orthogonal to the cycle space, i.e. there 
is a binary circulation cf>p with li^n^l odd. Now we pair up each binary circulation if on G with the binary 
circulation ip 1 := tp © <f>p. This yields a pairing of all binary circulations on G since for all if, ip' ^ tf an d 
ip" = ip. Modulo 2, \Fnip'\ = |.Fn ip\ + \FH(Pf\ = \Fr\ip\ + 1, so in each pair, exactly one of the two binary 
circulations has even intersection with F. Thus, exactly half of all binary circulations have even intersection 
with F, which proves the result. □ 

Next we give a method for constructing binary circulations (it is an undirected version of [BM76, Ex. 
12.1.1]). Given a spanning tree T and subset ip of E\E(T), we say that <p is a completion of tp if (p is a 
binary circulation and <f> H (E\E(T)) = xp. 

Proposition 2.6. For any ip C E\E(T), ip has a unique completion (p. 
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Figure 1: Completing a binary circulation. The spanning tree T is given by thick edges. Solid edges are 
in the circulation, dotted edges will not be in the circulation, and dashed edges are undecided. Left: the 
initial value of <f> (which equals ip). Middle: we ensure a leaf vertex v has even degree. Right: repeating the 
previous step yields the completed circulation <p. 

Proof. First, we give a succinct proof sketch. By Proposition 2.3(c) the cycle space can be expressed as 
{®e£E\E(T) a eCe I £ ^2 }• For which a does this yield a completion of ip? From the observations in 
the proof of Proposition 2.3, for / e E\E(T), the coordinate of Q) ee E\E(T) a eCe at index / is just a/, hence 
the unique completion of ip is the one in which a is the indicator vector of ip, i.e. the unique completion is 
4> = © eG C e . Explicitly, for / e T, we have / G <p iff / appears in an odd number of the fundamental cycles 
{C e | e G tp}. This completes the proof, but we now give a second, algorithmic proof, which is needed later. 

For a leaf node v incident to e £ E(T), since the degree of (V, <p) at v must be even, notice that we must 
have e £ <j) if |^n<$(i;)| is odd, and e £ <p if |^n5(u)| is even. Iterating this argument on T\v yields Algorithm 
1; we will show it constructs the unique completion of ip. 



Algorithm 1 Given G,T and ip C E\E(T), construct binary circulation <p such that <p\E(T) = ip. 
1: Initialize <f> := ip, S := T >5is the subtree of T where <p is not yet defined 

2: while S has any edges, 

3: Let v be any leaf of S and e be the unique incident edge of v in S 

4: if \S(v) n <p\ is odd then <f) := (f> U {e} > Satisfy degree constraint at v 

5: Delete v from S 1 
6: Output <j) 



See Figure 1 for an illustration of Algorithm 1. Now we prove Proposition 2.6 using Algorithm 1. It is 
clear that every vertex of (V, </>) has even degree except possibly the last vertex left in S. However, by the 
handshake lemma, no graph can have exactly one vertex of odd degree, so <f> is indeed a binary circulation. 
To show uniqueness, suppose for the sake of contradiction that -0 has two distinct completions <j),<f>'- Then 
<t> © 4>' C E(T), and as such the nonempty forest 4>(&(j)' has at least one vertex of degree 1. This contradicts 
the fact that (f> © <p' is a binary circulation. □ 

We now give the method for constructing uniformly random binary circulations, illustrated in Algorithm 
2: pick a uniformly random subset of E\E(T) and then compute its completion. 



Algorithm 2 Given G and spanning tree T, output a uniformly random binary circulation. 
1: for each e in E\E(T), put e in ijj with independent probability 1/2 
2: Return the completion of "0, using Algorithm 1 



Theorem 2.7. Algorithm 2 outputs a uniformly random binary circulation. 

Proof. By Proposition 2.3(b) the cycle space contains exactly 2' £; l _ l 1/ l +1 elements. Algorithm 2 makes one 
f 2l- E H y l+ 1 choices of ip each with probability 2~l £ l + l y l~ 1 , and each distinct choice of ip leads to a distinct 
binary circulation. □ 



G 




101 010 111 

111 100 100 111 
^Oll^K 



Figure 2: Constructing a random 3-bit circulation; thick edges are tree edges and thin edges are non-tree 
edges. Left: we assign random values to the non-tree edges. Middle: we set 0(e) := ®f e g( v \\ e 4>{f) f° r a 
leaf vertex v. Right: repeating the previous step yields the completed circulation 0. 

To increase the probability of identifying a particular cut beyond 1/2, our algorithms will sample multiple 
independent random circulations. For this reason it is convenient to introduce notation that incorporates 
multiple circulations into a single object. Let Z 2 denote the set of 6-bit binary strings. For 
4>i(e) denote the ith bit of 0(e). 



: E -> Z|, let 



Definition 2.8. : E — > l\ is a b-bit circulation if for each 1 < i < b, {e | 0j(e) = 1} is a binary circulation. 

Hence, to say that is a uniformly random 6-bit circulation is the same as saying that {4>i}\ = i are 
mutually independent, uniformly random binary circulations. For brevity, we use the phrase random b-bit 
circulation to stand for "uniformly random 6-bit circulation" in the rest of the paper. Let denote the 
all-zero vector and © denote addition of vectors modulo 2. Using Proposition 2.2 and Proposition 2.5 we 
obtain the following corollary. 

Corollary 2.9. Let <fi be a random b-bit circulation and F C E. Then 



Pr 



00(e) = O 



.eeF 



1, if F is an induced edge cut; 
2~ h , otherwise. 



To generate a random 6-bit circulation, it suffices to modify Algorithms 1 and 2 slightly so as to operate 
independently on each of 6 positions at once: on Line 1 of Algorithm 2 we set 0(e) to a uniformly independent 
6-bit string, and on Line 4 of Algorithm 1 we set 0(e) := ®f e s( v )\e </>(/)■ We denote the resulting algorithm 
by Rand-6-Bit-Circ and illustrate it in Figure 2. Under the standard assumption that the machine word 
size is 0(log V), the running time of Rand-6-Bit-Circ in the sequential model of computing is 0(E [~ lo gy ] )■ 

3 Basic Algorithms 

In this section we show how to use random circulations to probabilistically determine the cut edges, cut 
pairs, and cut vertices of a graph. These are the Monte Carlo versions of the algorithms. 

3.1 Finding All Cut Edges 

We provide pseudocode in Algorithm 3 and then prove its correctness. It is based on the easy fact that e is 
a cut edge if and only if {e} is an induced edge cut, which we state without proof. 



Algorithm 3 Given a connected graph G, compute the cut edges of G. 



1: Let 6 = |~log 2 VE~\ and let be a random 6-bit circulation on G. 
2: Output all edges e for which 0(e) = 



Theorem 3.1. Algorithm 3 correctly determines the cut edges with probability at least 1 — l/V and can be 
implemented in O(E) sequential time. 
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Figure 3: A graph is shown with one cut class highlighted using dashed edges, 
edges disconnects the graph. 



Deleting any two dashed 



Proof. Using the fact above, Corollary 2.9, and a union bound, the probability of error is at most E/2 b < 1/V. 
The subroutine Rand-6-Bit-Circ as well as Line 2 of Algorithm 3, each take O(E) sequential time. □ 

3.2 Finding All Cut Pairs and Cut Classes 

Proposition 3.2, whose easy proof we omit, leads to our approach for finding cut pairs. 

Proposition 3.2 (Cut pairs are induced). Let e and f be edges that are not cut edges. Then {e, /} is a cut 
pair if and only if {e, /} is an induced edge cut. 

With Corollary 2.9 we immediately obtain the following. 

Corollary 3.3. Let e, / be two distinct edges that are not cut edges. Then Pr[0(e) = 0(/)] — 1 if {e, /} is 
a cut pair, and 2~ b otherwise. 

This yields a cute probabilistic proof of the following basic fact. 

Corollary 3.4 (Transitivity of cut pairs). // {e, /} and {/, g} are cut pairs, then so is {e, g}. 

Proof. Note that e, /, g are not cut edges. Let be a random 1-bit circulation on G. By Corollary 3.3, 
0(e) = 0(/) and </>(/) = 0(<?). So 0(e) = <j>(g) with probability 1. By Corollary 3.3, {e,g} must be a cut 
pair. □ 

Definition 3.5. A cut class is an inclusion-maximal subset K of E such that \K\ > 1 and every pair 
{e, /} £ K is a cut pair. 

We illustrate a cut class in Figure 3. Note the cut class has a natural cyclic order. 

Corollary 3.4 implies that any two distinct cut classes are disjoint. Hence, even though there may be 
many cut pairs, we can describe them all compactly by listing all cut classes of the graph. We now give our 
simple linear-time algorithm to find all cut classes, with pseudocode given in Algorithm 4. 



Algorithm 4 Given a connected graph G, compute the cut classes of G. 
1: Let b — |~log 2 (V E 2 )~\ and let be a random &-bit circulation on G 

2: for each x € Z^O} such that |{e 6 E | 0(e) = x}\ > 2, output the cut class {e € E \ 0(e) = x} 



Theorem 3.6. Algorithm 4 correctly determines the cut pairs with probability at least 1 — 1/V and can be 
implemented in O(E) sequential time. 

Proof. There are \E\ edges and the analysis in Section 3.1 shows that Pr[0(e) = 0] < l/2 b for each non-cut 
edge e. There are at most (^) pairs {e, /} of non-cut edges that are not cut pairs and Corollary 3.3 shows 
that Pr[0(e) = 0(/)] < l/2 b for each such pair. Hence, by a union bound, the total probability of error is at 
most E/2 b + (f)/2 fc < 1/V. 
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The subroutine Rand-6-Bit-Circ has time complexity O(E). It remains to implement Line 2 of Algo- 
rithm 4 in 0(E) time. To do this, we sort all edges e according to the key (j)(e) using a three-pass radix sort. 
I.e., we consider each value in Z 2 as a three-digit number in base 2 6 / 3 = 0(E) — see [CLR90, §9.3] — then 
the sort takes O(E) time. □ 

3.3 Finding All Cut Vertices 

The following characterization of cut vertices underlies our approach. 

Proposition 3.7. The cut 6(v) properly contains a nonempty induced edge cut if and only if v is a cut 
vertex. 

Proof. First, suppose v is a cut vertex. Let V\ be the vertex set of one of the connected components of 
G\{z;}- Then S(v) properly contains the nonempty induced edge cut S(Vi). 

Second, suppose v is not a cut vertex, so there is a spanning tree T' of G\{u}. Suppose S C V has 
S(S) C S(v). Without loss of generality (by complementing S if necessary) we assume u£S. Since no edges 
of T" are in S(S), S either contains all of U\{v} or none of y\{t>}. Thus either S = V in which case 5(S) is 
empty, or S = {v}, in which case S(S) is not a proper subset of S(v). □ 

Using Proposition 3.7, the essential idea in our approach to find cut vertices is to detect for each vertex 
v whether S(v) properly contains any nonempty induced edge cuts. As usual we detect induced edge cuts 
via Corollary 2.9, this time rephrasing the detection problem as one of finding linearly dependent rows of a 
matrix. Hence we need the following fact, when Z 2 is viewed as a field. 

Fact 3.8. In a matrix over Z 2 , a set C of columns is linearly dependent if and only if some nonempty subset 
of C sums to the zero column vector (mod 2). 

Our approach works as follows. Note — it does not have a very efficient sequential implementation, but 
yields an efficient distributed algorithm. We generate a random &-bit circulation <fi for some suitably large 6; 
denote the ith bit of (j)(e) by 4>i(e). Let d(v) :— \5(v)\, the degree of v. Let A denote the maximum degree. 
For each vertex v, let be a matrix with b rows indexed 1, . . . , b, and d(v) columns indexed by 5(v); 

then fill the entries of MM according to = <j>i(e). The following two complementary claims validate our 
approach. 

Claim 3.9. If v is a cut vertex then ran k(AfH) < d(v) - 2. 

Proof. Let V\ be the vertex set of one of the connected components of G\{u}. Note that 8(v) can be 
partitioned into two induced edge cuts S(Vi) and 5({v} U Vi). By Corollary 2.9 the set of columns of M'"' 
corresponding to 5(Vi) adds to zero, and by Fact 3.8 these columns are linearly dependent. Similarly, the 
remaining columns, indexed by S({v} U Vi), are linearly dependent. So has at least 2 columns that are 
linearly dependent on the others, and the result follows. □ 

Claim 3.10. Let v G V and assume that v is not a cut vertex. Let C D C 8(v). The probability that the 
columns of MM indexed by D sum to the zero vector (mod 2) is 2~ b . 

Proof. By Proposition 3.7, D is not an induced edge cut, and the result follows from Corollary 2.9. □ 

Next we show that for b = \A + 2 log 2 V~\ , it is very likely that iank(M^) < d(v) — 1 iff v is a cut vertex. 
Thus our approach, with pseudocode given in Algorithm 5, is correct with high probability. 



Algorithm 5 Given a connected graph G, compute the cut vertices of G. 
1: Let b = \A + 21og 2 V] and let <f> be a random 6-bit circulation on G 
2: for each vertex v of G, if rank(Af^) < d(v) — 1 then output v 
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Theorem 3.11. Algorithm 5 correctly determines the cut vertices with probability at least 1 — 1/V. 

Proof. Claim 3.9 shows that all cut vertices are output. Consider a vertex v that is not a cut vertex and 
let D be a subset of S(v) of size d(v) — 1. By Claim 3.10, Fact 3.8, and a union bound, the probability 
that the columns of Af^ corresponding to D are linearly dependent is at most 2 d ^~ 1 2^ b < 1/V 2 ; so with 
probability at least 1 — V~ 2 , we have rank(M^l) > \D\ — d(v) — 1 and v is not output. By another union 
bound, the probability that any vertex is misclassified by Algorithm 5 is at most V/V 2 = 1/V. □ 

4 Distributed Implementation 

Our algorithms make the following three assumptions: first, the network is synchronous; second, there is a 
distinguished leader vertex at the start of computation; third, every node begins with a unique O(logV)- 
bit ID. These assumptions are standard in the sense that they are made by the best previous distributed 
algorithms [AZ89, Thu97, Tsi06] for small cuts. Nonetheless, these assumptions can be removed at a cost if 
desired, e.g. using the synchronizer of [AP90] at a polylog(V) factor increase in complexity, Peleg's [Pcl90] 
0(2?)-time leader election algorithm, or by randomly assigning IDs in the range {1, . . . , V^ 3 } (resulting in 
additional failure probability at most ( 2 ) /V 3 due to ID collisions). 

Although only vertices can store data in the distributed model, we maintain data for each edge e (e.g., 
to represent a tree) by having both endpoints of e store the data. At the end of the algorithm, we require 
that the correct result is known locally, so each node stores a boolean variable indicating whether it is a 
cut node, and similarly for edges. To indicate cut pairs, each edge must know whether it is in any cut 
pair, and in addition we must give every cut class a distinct label. Previous work also essentially uses these 
representations . 

When stating distributed algorithms, the assumptions of a leader, synchrony, unique IDs, and O(logV)- 
bit messages are implicit. Our algorithms use a breadth-first search (BFS) tree with a root r as the basis 
for communication. One reason that BFS trees are useful is that they can be constructed quickly (e.g., see 
[PelOO, §5.1]), as follows. 

Proposition 4.1. There is a distributed algorithm to construct a BFS tree in 0(T>) time and O(E) messages. 

For a tree T, the level l(v) of v e V is the distance in T between v and r. The height h(T) of tree T 
is the maximum vertex level in T. Any BFS tree T has h(T) < T> and this is important because several 
fundamental algorithms based on passing information up or down the tree take 0(h(T)) time. The parent 
of u is denoted p(u). The level of tree edge {u,p(u)} is the level of u. 

4.1 Random Circulations and Cut Edges 

When we construct a random circulation, we require at termination that each v knows </>(e) for each e £ S(v). 

Theorem 4.2. There is a distributed algorithm to sample a random b-bit circulation in 0(T>) time and 
0(E) messages, when b = 0(logV). 

Proof. We implement Rand-6-Bit-Circ distributively. The size bound ensures that 6-bit strings can be 
sent in a message. We compute a BFS tree T, using Proposition 4.1. Then for each non-tree edge {e} 
in parallel, the endpoint with the higher ID picks a random &-bit value for </>(e) and sends it to the other 
endpoint. In the following h(T) rounds, for i = h(T) down to 1, each level-i vertex computes 0({i>,p(v)}) := 
®feSM\{v p(v)} 000 and sends tlus value to P( v )- Tne complexity is 0(D+h(T)) = 0{V) time and 0(E+E) 
messages. □ 

Theorem 4.2 yields our distributed cut edge algorithm. 

Theorem 4.3. There is a distributed algorithm to compute all cut edges with probability at least 1 — 1/V in 
0(T>) time and using 0(E) messages. 
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Proof. We implement Algorithm 3 distributively, obtaining the required correctness probability by Theorem 
3.1. For k — VE, we use Theorem 4.2 to compute a random [log 2 VE] -bit circulation in the required 
complexity bounds. Then we identify e as a cut edge if </>(e) =0. □ 



4.2 Pipelining and Cut Vertices 

Our cut vertex algorithm requires a circulation on 0(A + \ogV) bits, and in order to construct such a 
circulation quickly, we use a pipelining technique. Let 7r be a distributed algorithm in which for each edge 
e, the total number of messages sent on e by tt is bounded by some universal constant Co- The messages' 
content may be random but the message-passing schedule must be deterministic. To pipeline s instances 
of tt means to execute s instances {^i\ s i=l of tt, each one delayed by a unit time step from the previous. 
When multiple instances need to simultaneously send messages along the same edge we concatenate them, 
increasing the message sizes by a factor of at most Cq. Compared to tt, pipelining adds s — 1 to the time 
complexity and increases the message complexity by a factor of s. 

A straightforward implementation of Algorithm 5 results in our cut vertex algorithm, as follows. 

Theorem 4.4. There is a distributed algorithm to compute all cut vertices with probability at least 1 — 1/V 
in 0(T> + A/ log V) time and using 0(E(1 + A/ logV^)) messages. 

Proof. We implement Algorithm 5 distributively, obtaining probability 1/V of failure by Theorem 3.11. Let 
b = |~A + 2 log 2 V] . Theorem 4.2 gives an algorithm tt to construct a random 0(log F)-bit circulation; note 
tt sends a constant number of messages along each edge. We pipeline b/\ogV instances of tt to construct a 
random 6-bit circulation. Then, each vertex v locally computes the rank of to determine if it is a cut 
vertex. 

Since tt takes 0(T>) rounds and sends O(E) messages, and b = 0(A + \ogV), the implementation takes 
Q{V + A/ logF) time and 0(E(1 + A/log V)) messages. □ 



4.3 Fundamental Cycle-Cast (fc-cast) 

We now define a new distributed technique, needed for our cut pair algorithm. A non-tree edge is an edge 
e G E\E(T). For a spanning tree T and non-tree edge e, the unique cycle in TU{e} is called the fundamental 
cycle of T and e, and we denote it by C e . We call our new technique fundamental cycle-cast, or fc-cast for 
short, and informally it allows simultaneous processing on all fundamental cycles. Let each vertex v store 
some data d[v] of length 0(log V) bits. We assume that d[v] includes the ID, level, and parent ID of v, since 
this information can be appended to d[v } while increasing its length by at most 0(log V) bits. At the end of 
the fc-cast, each non-tree edge e will know d[u] for every vertex u in the fundamental cycle of T and e. 

Theorem 4.5. There is a distributed algorithm Fc-Cast using 0(h(T)) time and 0(mm{E ■ h(T),V 2 }) 
messages that, for each non-tree edge e, for each v € C' e , sends d[v] to both endpoints of e. 

As a subroutine, we need a tree broadcast subroutine adapted from [PelOO, §3.2]. 

Proposition 4.6. There is a distributed algorithm Tree-Broadcast using 0(h(T)) time and 0(V ■ h(T)) 
messages that sends d[v] to u for each v £ V and each descendant u of v. 

Proof. Let tt be a generic distributed algorithm that sends one message from p(v) to v at time l(v); in 
particular, tt takes 0(V) messages, 0(h(T)) time, and sends at most one message on each edge. Define 
instances {7Ti}i=o °f so that for every vertex v at level i, and for every descendant u of v, instance TTi is 
responsible for propagating d[v] to u. Each instance 7Tj sends empty messages for the first i rounds, and 
in round t > i, for each v with l(v) — i, propagates d[v] down the level-t tree edges descending from v. 
Since there are h(T) + 1 pipelined instances and tt takes 0(h(T)) time and 0(V) messages, the complexity 
follows. □ 
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Proof of Theorem 4-5. An fc-cast has two steps. First, we execute Tree-Broadcast, and as a result we 
may assume that each vertex has a list of the data of all its ancestors. 

In the second step, for each non-tree edge {v, w} in parallel, v sends its list to w and vice- versa. Note that 
each non-tree edge e can determine its fundamental cycle with T by comparing its endpoints' lists. (More 
precisely, either endpoint of e can determine such.) Each list has at most 1 + h(T) items, each of which is 
0(\ogV) bits long and can be sent in a single message, so both steps in the fc-cast take 0(h(T)) time. 

The message complexity of the second step as just described is 0(E ■ h(T)), but now we give a refinement 
that achieves 0(mm{.E • h(T),V 2 }) message complexity. The essential idea is for all u, v € V, we want 
to avoid sending d[u] to v more than once. Implement the second step of the fc-cast so that each vertex 
v sends one d[-] value per round, and in the order d[v] first, then d[p(w)], etc., with the data of the root 
last. When a vertex u receives d[x] for the second time for some x, u asks the sender to stop sending 
its list. Likewise, if u receives d[x] from multiple neighbors at the same time, u asks all but one to stop 
sending their lists. Along each edge, at most one redundant message and one stop request can be sent 
in each direction. There can only be V 2 non-redundant messages; hence the total number of messages 
sent in this step is 0(V 2 + E). Considering the tree- broadcast as well, the total message complexity is 
0(V ■ h(T) + mm{E ■ h(T), V 2 + E}) = 0{mm{E ■ h(T),V 2 }) as claimed. □ 

We can implement fc-cast in 0(h(T)) time with message complexity even smaller than mm{E ■ h(T), V 2 } 
using a nearest common ancestor labeling scheme of [AGKR02]. We only sketch the idea since the precise 
improved complexity is somewhat awkward to state (seemingly cannot be expressed in terms of parameters 
V, E, A, h(T)) and does not seem universally optimal. If uw is an edge not in T, call w a non-tree neighbour 
of u and vice- versa. The general idea behind the optimized implementation is that, while the implementation 
in Theorem 4.5 sends d[v] to each descendant of v and each non-tree neighbour of a descendant of v, we can 
actually send d[v] to a smaller subset of these nodes while meeting the definition of a fundemental cycle-cast. 

In more detail, the scheme of [AGKR02] gives each vertex an 0(log y)-bit label such that given just the 
labels of any two nodes, we can also compute the label of their nearest common ancestor (with a deterministic 
algorithm independent of T) . Alstrup et al. do not work in any specific distributed model, but their scheme 
is built out of standard primitives like the number of descendants of a given node, and as such can be 
implemented in the model we consider in 0(h(T)) time and O(E) messages. The first step of our new 
implementation is to compute these labels. Then, in unit time and 2\E\ messages, we have each node inform 
each of its neighbours of its label. 

At a high level, the labeling scheme allows the implementation to be optimized as follows. In the first 
step we send d[v] down to its descendant u only if there is some fundamental cycle containing both u and 
v; in the second step each v asks for d[-] values from its non-tree neighbours in such a way that u receives 
each d[-] value at most once, and only asks for d[v] from w if C uw contains v. Implementing these steps 
requires that nodes have some knowledge about the relative position of their neighbours in the tree, which 
is accomplished using the labels. There are some slightly complicated details in implementing the first step, 
for which a pipelined convergecast (see Proposition 4.9) suffices. 

4.4 Distributed Cut Pair Algorithm 

When computing the cut pairs, it helps if we assume that G has no cut edges, i.e. G is 2-edge-connected. 
To make this assumption without loss of generality, for our input graph G, we compute the set Ec of cut 
edges using Theorem 6.3 and then report the cut pairs of the 2-edge-connected components, which are the 
connected components of G\Eq (we elaborate in Section 5). It is straightforward to show that the cut pairs 
of G are the cut pairs of these components, that each component has no cut edge, and that no component 
has diameter greater than G. 

It is not obvious how to implement our sequential cut pair algorithm (Algorithm 4) distributively: al- 
though the cut classes are properly labeled with high probability by (f>, in order for edge e to know whether 
it belongs to any cut pair, it needs to determine if any other / has cj){e) = (j)(f), and this cannot be done 
using local information (i.e., in O(l) rounds). We use fc-cast to overcome this obstacle. The following claims 
are used to relate fundamental cycles to cut classes. (The first is fairly intuitive given Figure 3 on page 8.) 



12 



Lemma 4.7. If a cycle C and a cut class K satisfy K n C ^ then K C C. 

Proof. Suppose that e £ K C but / £ K\C. Then by Proposition 3.2, {e, /} is an induced edge cut. But 
then |{e, /} PI C\ = 1, contradicting Proposition 2.2 (the orthogonality of the cut space and cycle space). □ 

Claim 4.8. Let K be a cut class. Then K C C e for some e £ E\E(T). 

Proof. First we claim K contains at most one non-tree edge. Suppose otherwise, for the sake of contradiction, 
that K contains two non-tree edges {e, /}. Then {e, /} is a cut pair and so G\{e, /} is not connected. 
However, this contradicts the fact that G\{e, /} contains the spanning tree T. 

The definition of a cut class implies \K\ > 1, so K contains at least one tree edge e. Since e is not a cut 
edge, G\{e} is connected, and hence there is a non-tree edge / that connects the two connected components 
of T\{e}. The fundamental cycle Cf of / and T thus contains e, and by Lemma 4.7, all of K. □ 

To describe our cut pair algorithm we introduce a variant of a standard technique, the convergecast (e.g., 
see [PelOO, §4.2]). Informally, it allows each node to independently query its descendants. In this paper we 
take the convention that v is always a descendant of itself. Let Desc(v) denote the set of i>'s descendants. 
For each v £ V, and each u £ Desc(v), let w[u, v] be a variable of length O(logy) stored at u. 

Proposition 4.9. There is a distributed algorithm Converge-Cast that uses 0(h(T)) time and 0(V-h(T)) 
messages so that each v £ V determines max{w[it, v] \ u £ Desc(v)}. 

Proof. We assume some familiarity with the basic implementation of convergecast in order to gloss over some 
basic details; see [PelOO, §4.2]. We use n to represent a generic distributed algorithm that sends messages 
from leaves to the root in level-synchronized fashion. The "standard" convergecast uses tt to compute 
max{w[u,r] | u £ V} at r; in round i, for i from h(T) down to 1, every level-i node passes up the largest 
value that it knows about to its parent. A slight modification yields instances {TTj}^] of tt so that for every 
vertex v at level i, instance iti propagates max{w[w, v] \ u £ Desc(v)} to v. Since there are h(T) + 1 pipelined 
instances and tt takes 0(h(T)) time and 0(V) messages, the complexity follows. □ 

Theorem 4.10. There is a distributed algorithm to compute all cut classes with probability at least 1 — l/V 
in 0(T>) time and using 0(mm{E ■ 2?, V 2 }) messages. 

Proof. As in Algorithm 4, for b = |~log 2 (V^-E 2 )] we compute a random 6-bit circulation 4> on G, using Theorem 
4.2. Denote the following assumption by (★). 

For all edges e, /, 0(e) = (f>(f) if and only if {e, /} is a cut pair. (*) 

By the analysis in the proof of Theorem 3.6, we may assume that (*) holds without violating the required 
bound of l/V on the probability of error. 

It remains only for each edge to determine whether it is a member of any cut pair, since then (f> labels the 
cut classes. For each vertex v ^ r let d[v] := (j>({v , p(v)}) . We run Fc-Cast, and as a result, the endpoints 
of each non-tree edge e can compute the multiset $ e := {</>(/) | / £ C e }. The following claim, which follows 
immediately from Claim 4.8, lets each non-tree edge determine if it is a member of any cut pair. 

Claim 4.11. A non-tree edge e is in a cut pair if and only if 0(e) occurs multiple times in $ e . 

To deal with tree edges, for each v £ V and each u £ Desc(v), define 

v[u,v] := \{e £ 5{u)\E{T) \ {v,p(v)} £ C e k (f>({v , p(v)}) occurs > 2 times in $ e }|. 

and note that w[it, v] can be determined by u after the fc-cast. We run Converge-Cast. 
Claim 4.12. Tree edge {v,p(v)} is in a cut pair if and only if3u £ Desc(v) such that w[u,w] > 0. 

Proof. If {v,p(v)} lies in a cut pair then by Claim 4.8 there is a fundamental cycle C e containing that cut 
pair. It is easy to see that one endpoint u of e is a descendant of v and has w[it, v] > 0. □ 
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By Proposition 4.9, after the convergecast, each tree edge can use Claim 4.12 to determine if it is a member 
of any cut pair. Adding up the complexity associated with constructing a BFS tree and a random circulation, 
the fc-cast, and the converge-east, we obtain 0(D + V + V + V) time and 0(E + E + min{^P, V 2 } + VV) = 
0(wm{ET>^ V 2 }) messages, as claimed. □ 



5 Computing {2, 3}-Edge-Connected Components 

Let Ec denote the set of all cut edges, and Ecp denote the set of all edges in any cut pair. 

Definition 5.1. The 2 -edge- connected components are the connected components of G\Ec- The 3-edge- 
connected components are the connected components of G\(Ecp U Ec)- 

In the sequential model, connected components of a graph can be computed in linear time. Hence we 
immediately see that our linear-time sequential cut edge and cut pair algorithms yield linear-time algorithms 
for 2- and 3-edge-connected components. 

In the distributed model, we first discuss 2-edge-connected components. Let T denote a spanning tree 
and r its root. The desired representation is for each vertex v to store a label t(v) so that t(u) = t(v) iff 
u, v are in the same 2-edge-connected component. Observe that Ec C E(T), since if e ^ T, then G\e D T 
is connected. Furthermore, the following holds. 

Claim 5.2. If u,v are in the same 2-edge-connected component, there are no cut edges on the unique u-v 
path in T . 

Proof. Suppose such a cut edge e = {u',v'} exists, where u' is the end of e closer to u along the u-v path 
in T. Then in G\{e}, the remainder of the tree path connects u to u' and v to v' . Since u,v are in the 
same 2-edge-connected component, u and v are connected in G\{e}. Thus u' and v' are connected in G\{e}, 
contradicting the fact that e = {u' , v'} is a cut edge of G. □ 

Corollary 5.3. T\Ec is a spanning forest of the 2-edge-connected components. 

In particular, for each 2-edge-connected component H, there is a subtree T# of T\Ec spanning H . The 
idea is to label the vertices of H by the ID of the root of Tjj ■ 

Theorem 5.4. There is a distributed algorithm to compute all 2-edge-connected components with probability 
at least 1 — 1/V in 0(T>) time and using 0(E) messages. 

Proof. Note for a vertex v, where H denotes its 2-edge-connected component, v is the root of Th if and only 
if either v is the root r of T, or {v,p(v)} is a cut edge. Otherwise, v and p(v) are in the same 2-edge-connected 
component. 

First we compute the cut edges, using Theorem 4.3. Vertex r sets r(r) equal to its ID. In the following 
h(T) rounds, for i = 1 to h(T), for all level-i tree edges {v,p(v)} in parallel, vertex p(v) sends r(p(v)) to v. 
Upon receiving this message, v sets t(v) :— ID(v) if is a cut edge, and t(i>) := r(p(v)) otherwise. 

The labeling takes 0(h(T)) time and \V\ — 1 messages, and the result follows. □ 

Now we discuss 3-edge-connected components. In the distributed model, we can represent a subgraph 
(V, F) of (V, E) by using a local boolean variable for each edge. For this representation, [Thu97] gave a 
distributed connected components algorithm in 0(V + VVlog* V) time, using an MST subroutine in which 
the weight of edge e is 1 for e ^ F and for e G F. Hence we have the following corollary to our cut pair 
algorithm, Theorem 4.3. 

Corollary 5.5. There is a distributed algorithm to compute all 3-edge-connected components with probability 
at least 1 — 1/V in 0(V + ^/Vlog* V) time and using 0(E(V + \jv log* V)) messages. 
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6 Las Vegas Distributed Implementation 



In this section we describe how to turn our Monte Carlo distributed algorithms into Las Vegas algorithms, 
by giving a verifier for each one. Given the output of the Monte Carlo algorithm, the verifier determines 
whether the output is correct or not; we re-run the Monte Carlo algorithm until the output is verified correct. 
For each of our verifiers, the time complexity is no more than the time complexity of the corresponding Monte 
Carlo algorithm; this fact and the fact that our algorithms work with high probability together imply that 
the resulting Las Vegas algorithms have the same asymptotic complexity as the Monte Carlo ones. See 
[MROO, §1.2] for more details. 

Here is a high-level description of the three verifiers. The cut edge verifier works by labeling vertices 
according to their 2-edge-connected component; the cut vertex verifier works by labeling edges according to 
their blocks; the cut pair verifier works by exploiting relations between cut classes and fundamental cycles. 
All three of the verifiers rely on the fact that our Monte Carlo algorithms have one-sided error. 

6.1 Verifier for Cut Edges 

Recall that Algorithm 3 always outputs all cut edges, but may erroneously output some non-cut edges. 
Observe that a non-tree edge cannot be a cut edge; so we may assume the Monte Carlo algorithm outputs a 
set E' c such that E(T) D E' c D Ec, by having the verifier reject any output containing a non-tree edge. Here 
is the key idea: we compute the connected components of T\E' C . We only need to show how to determine 
if E' c \Ec is nonempty; this can be done using the following proposition and its converse, which follows. 

Proposition 6.1. If E' c \Ec is nonempty, there is a non-tree edge joining vertices in different connected 
components ofT\E' c . 

Proof. Let e be any element of E' c \Ec- Since e is not a cut edge, there is another edge f £ E connecting the 
two connected components of T\e. The endpoints of / lie in different connected components of T\E' C . □ 

Proposition 6.2. If E' c \Ec is empty, then the connected components of T\E' C are the 2-edge-connected 
components, and every non-tree edge has its endpoints in the same connected component ofT\E' c . 

Proof. Corollary 5.3 guarantees that the connected components of T\E' C are the 2-edge-connected compo- 
nents of G. Since each non-tree edge lies in at least one cycle (e.g. its fundamental cycle with T) its endpoints 
lie in the same 2-edge-connected component. □ 

Theorem 6.3. There is a Las Vegas distributed algorithm to compute all cut edges in 0(D) time and using 
0(E) messages, in expectation. 

Proof. We run the 0(T>)-time, 0(E)-messa.ge Monte Carlo cut edge algorithm from Theorem 4.3, and as 
remarked earlier, we know its output E' c satisfies E' c D Ec- Then we run the following verifier, terminating 
if it accepts, and restarting from scratch (i.e., re-running the Monte Carlo algorithm) as long as it rejects. 

If E' c contains a non-tree edge, we reject. Otherwise (if E' c C E(T)) we compute the connected compo- 
nents of E(T)\E' C using an implementation like that in the proof of Theorem 5.4, which takes 0(V) messages 
and 0(T>) time. If any non-tree edge has both endpoints in different components we reject, otherwise the 
verifier accepts; this can be checked in unit time and 0(E) messages. It follows from Propositions 6.1 and 

6.2 that the verifier accepts if and only if E' c — Ec- Since the probability of acceptance is f2(l), the expected 
time complexity is 0(T> + V + 1) and the expected message complexity is 0(E + V + E). □ 

6.2 Verifier for Cut Pairs 

As in Section 4.4 we assume without loss of generality in this section that G is 2-edge-connected. 

Consider the output of our Monte Carlo cut pair algorithm, Algorithm 4. The sense in which its output 
is one-sided is that every cut class is a subset of one of its output classes; the verifier must ensure that no 
cut class is "too big." To explain our approach, we define a notion of "wanting." Recall $ e , the multiset 
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{<M/) I / G Ce} defined in Section 4.4; if the value x appears more than once in $ e , say that e wants the set 
{/ € C e | </>(/) = 2;}. With high probability, the wanted sets are precisely the cut classes. First, our verifier 
checks that whenever an edge lies in two wanted sets, those sets are the same; second, we use the following 
proposition to verify that no wanted set is "too big." 

Proposition 6.4. Let T be any spanning tree and e, / be edges that are not cut edges. If {e, /} is not a cut 
pair, then some fundamental cycle of T contains exactly one of e and f. 

Proof. We prove the contrapositive; hence we assume that the characteristic vector of {e, /} has even dot 
product with every fundamental cycle. By Proposition 2.3(c) the fundamental cycles form a basis of the 
cycle space; so {e, /} is orthogonal to the cycle space, and by Proposition 2.3(a), lies in the cut space. Thus 
{e, /} is an induced edge cut, and so (by Proposition 3.2) a cut pair. □ 

In order to apply Proposition 6.4, we count the size of all wanted sets, since then each non-tree edge can 
determine if its fundamental cycle is "missing" some members. Our strategy uses a modified Converge- 
Cast (Proposition 4.9) where we interpret max as lexicographic comparison of data. We need to give each 
edge a distinct 0(logy)-bit name, e.g. by concatenating the IDs of its endpoints. When e wants S, it sends 
the ordered pair (e, \S\) towards all of S. (Concretely, for each tree edge {v,p(v)} in S, this data is sent to v.) 
If two pairs (e, k) and (e', k') such that k ^ k! are sent to the same location, the verifier rejects. Otherwise, 
each tree edge takes the label (e, k) where e is the lexicographically-maximal edge that wants it. We run 
another fc-cast with the new labels; then each non-tree edge / checks, for each distinct label (e, fc) occurring 
in Cf, that there are exactly k edges in Cf with label (e, k). The complexity of the verifier is dominated by 
the fc-cast, and we thereby obtain the following theorem. 

Theorem 6.5. There is a Las Vegas distributed algorithm to compute all cut classes in 0(T>) time and using 
0(mm{E ■ D,V 2 }) messages, in expectation. 

6.3 Verifier for Cut Vertices and Blocks 

For edges e, / in E(G), define e ~ / if either e = /, or e ^ f and there is a cycle that contains both e and 
/. It is well-known that ~ is an equivalence relation on E; its equivalence classes are called the blocks of G. 
A vertex is a cut vertex iff it is incident to more than one block. The overall strategy is to try to label the 
edges according to the blocks, and then check via a generating relation that our labeling is correct. 

The strategy for this verifier is more involved than for the other two, and a high-level description is 
as follows. Given two equivalence relations R and R' on the same set, we say that R refines R' if every 
equivalence class of R is a subset of some equivalence class of R' . Note that R refines R 1 and R 1 refines R if 
and only if R = R' . We use the notion of local blocks: 

Definition 6.6. The local blocks at v, denoted ~„, is an equivalence relation on 5(v) obtained by restricting 
~ to S(v) : namely we write e ~„ / iff e, / € S(v) and e ~ /. 

An analogue of Claim 3.10 will show that with high probability, the linear dependencies amongst columns 
of Af^ correspond to the local blocks at v. We hence compute equivalence relations ~^ on 5(v), for each v, 
with the following properties: 

• r~i' v always refines ~„ 

• we can collect the local relations into a global equivalence relation ~' on E 

• ~' always refines ~ 

• with high probability, ^' v =r ^ v for all v 

• if ~^=~u for all v, then ~'=~ 

Finally, we need to check whether ~'=~. To perform this check, we adapt an approach from work of [TV85] 
and [Thu97], exemplified in the following proposition, which we will prove in Section 6.3.3. 
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Figure 4: The interaction between a spanning tree and the blocks of a graph. Thick lines are tree edges, 
thin lines are non-tree edges, and the dashed regions indicate the five blocks of the graph. 

Proposition 6.7. In 0(T>) time and O(E) messages we can compute a relation on E so that (1) whenever 
e /} e and f meet at a vertex, and (2) the symmetric reflexive transitive closure of is ~. 

Some logical manipulation shows that ~ refines ~' if and only if 

Vw : (Vu, w adjacent to v : {u, v} ~o { v , w} {u, v} ~' {«, w}) 
and as a result, local checks complete the verification. We now give the details. 

6.3.1 Computing 

What do the local blocks look like? It is not hard to see that the local blocks at v correspond to the connected 
components of G\v, in the sense that {u, v} ~„ {u>, v} if and only if u and w are connected in G\v. It is also 
straightforward to see that F C 5(v) is an induced edge cut if and only if F is a disjoint union of equivalence 
classes of ~„. We take b = \A + 21og 2 V~\ and just as in Claim 3.10, with probability 1 - 0(1/V 2 ), the 
following "good" case holds: the minimal sets of linearly dependent columns of correspond to the parts 
of (Notice that C is a minimal set of linearly dependent columns iff C's sum is the zero vector and no 
subset of C adds to the zero vector.) This leads to a simple idea, but we need to use some finesse in order 
that the ~' v we compute from M^l always refines ~„- 

Our starting point is to compute an arbitrary partition tt of the columns of into minimal zero-sum 
sets (such a partition exists because the sum of all columns is zero). It is possible that such a partition 
docs not refine ~' v ; so we need to check an additional property of n, namely that each pair of parts of tt 
has mutually orthogonal span. (If this property does not hold, the verifier rejects and we re-start the Monte 
Carlo algorithm.) This property ensures that the only zero-sum sets of columns are unions of parts of tt, 
which in turn shows that ~ v refines tt. (Moreover, this property holds in the "good" case.) So we obtain 
from tt by replacing each column by its index in S(v). 

6.3.2 Computing ~' from ~^ 

For the rest of the section we consider the spanning tree T upon which our algorithms operate as fixed; 
hence when we say "fundamental cycle of e" we mean with respect to T. We assume T is rooted at the 
leader vertex r and we let p(v) denote the parent of v in T. In collecting the local relations into a global 
relation, it is instructive to consider the interaction between T and the blocks of the graph; Figure 4 gives 
an illustration. It is not hard to argue that the intersection of T with any given block B is a subtree of T; 
we define the root r(B) of the block to be the root of this subtree. For example, in Figure 4, r and u are 
each the root of two blocks, and w is the root of one block. In general, the blocks for which v is the root 
correspond to the equivalence classes of ~„ not containing {v,p(v)} (if v = r, all equivalence classes of 
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We now define ~'. For computational purposes, assign each equivalence class X of ~„ a number i v (X), 
using the numbers 1,2,... for each v. Then assign each block B the label (r(B),i r m\(X)) where the 
equivalence class X is the intersection of S(r(B)) with B. At a high level, to compute ~ from ~„ within 
in each block, we broadcast its label starting from the block's root. Now given instead of ~„, we can 
mimic this strategy so as to compute a global relation ~'. We give pseudocode in Algorithm 6; the phrase 
"v sets directed label (v, u) to £?' means that v stores £ as the label of {v, u} and notifies u of this fact with 
a message. 



Algorithm 6 Given local relations ~' v , compute a global relation ~' . 
1: at each vertex v, number the equivalence classes of ~' v by 1,2,... 

2: at each vertex v, for each equivalence class X of not containing {v,p(v)}, for each {v,u} £ X, set 

directed label (v,u) to (v,i v (X)) 
3: when vertex w sets directed label (w, v) to £, if the label of (v, w) exists and is not equal to £ then FAIL, 

else if directed label (v,w) is unassigned, for each {v,u} ~^ {v,w}, set directed label (v,u) to £ 
4: take the edge labels to identify the equivalence classes of ~' 



Any pair of ~'-wise related edges are connected by a path of edges related pairwise by local ~' v relations; 
since ~^ refines ~„ which is a restriction of ~, we see that ~' refines ~. When ~^=~„ for all v, the preceding 
discussion implies that ~'=~. The message complexity of Algorithm 6 is 0(E). When ^' v —^ v for all v, the 
time complexity is T> rounds; if more rounds than this elapse we restart the Las Vegas algorithm. 

6.3.3 The Generating Relation 

In order to define ~o we need a few preliminaries. Let pre(v) denote a preordering of T starting from the 
root, and for each vertex v, let desc(v) denote the number of descendants of v. Thus the set of descendants 
of v is the set of vertices with preorder labels in {pre(v), . . . , pre(v) + desc(v) — 1}. The subtree-neighbourhood 
of v is defined to be v's descendants, in addition to every other vertex that is adjacent to a descendant of v 
via a non-tree edge. For each vertex v let the values low{v) and high(v) denote the minimum and maximum 
preorder label in the subtree-neighbourhood of v. Tarjan [Tar74] introduced these low and high functions; 
they have been used in several biconnectivity algorithms [TV85, Thu97]. 

Definition 6.8. The relation {w, v} ~i {v,p(v)} holds if and only if {w, v} £ T and either pre(w) < pre(v) 
or pre(w) > pre(v) + desc(v) (i.e., if w is not a descendant of v). The relation {v,p(v)} ^2 {p( v ) , p(p( v ))} 
holds if and only if either low(v) < pre(p(v)) or high(v) > pre(p(v)) + desc(p(v)) (i.e., if the subtree- 
neighbourhood of v is not contained in the descendants of p(v)). Define ~o to be the union of ~i and 

We illustrate these relations in Figure 5. Earlier work [TV85, Thu97] uses a different generating relation 
for ~; ours is simpler and also has the crucial property that every two edges related by ^0 have a common 
vertex. 

From now on, given a relation R, let R* denote the equivalence relation obtained by taking the reflexive 
symmetric transitive closure of R. We now prove the key property of ^o- 

Proof o/^q= ~ (Proposition 6.7). First, we argue that ~o refines ~; for this it suffices to show that when 
e ~i / for i € {1, 2}, e and / lie in the same block. If {w, v} ~i {v,p(v)}, the fundamental cycle of {v, w} 
contains {v,p(v)}, so {v, w} ~ {v,p(v)} as needed. If {v,p(v)} ^2 {p(v),p(p(v))} then there is edge from a 
descendant of v to a non-descendant of p(v); the fundamental cycle of this edge contains both {v,p(v)} and 
{p(v),p(p(v))}, as needed. 

Second, we must show that ~ refines Define e ^fc / if e and / lie on a common fundamental cycle. 
Tarjan & Vishkin [TV85, Thm. 1] show that r ^*pc =r ^ ■ So it suffices to show that when e ~fc /j e ~o / 
holds. In other words, we need to show that each fundamental cycle lies in a single equivalence class of ~q. 
We provide a pictorial argument of this fact in Figure 6. □ 
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Figure 5: Schematic illustrations of the relations ~i (left) and ^2 (right). Thick edges are tree edges, thin 
edges are non-tree edges, and triangles depict sets of descendants. Dotted arrows indicate pairs of edges 
related by ~j. 




Figure 6: The fundamental cycle C e in the proof of Proposition 6.7. Edges of T are thick lines and e is 
labeled. The left diagram shows the case that one of e's endpoints is a T-descendant of the other, while the 
right diagram shows the case that e's endpoints are unrelated. Dotted arrows indicate pairs of edges related 
by ~i. 

We now recap the distributed implementation of our cut vertex verifier. 

Theorem 6.9. There is a Las Vegas distributed algorithm to compute all cut vertices in 0(T> + A/ log V) 
time and using 0(E(1 + A/log V)) messages, in expectation. 

Proof. We compute a random 6-bit circulation for b — \ A + 2 log 2 V] and use the resulting values to compute 
local relations ~' v , (As mentioned in Section 6.3.2 the verifier may reject at this stage.) We then combine 
this information into a global labeling ~' of edges (and again, the verifier may reject at this stage). 

There is a straightforward distributed protocol to compute pre(v), desc(v),low(v) and high(v) at each v 
in 0(h(T)) = 0(D) time and using O(E) messages; see e.g. [Pri05, Thu97]. After this, each vertex sends 
these four values to all of its neighbours, with communication taking place along all edges in parallel; this 
takes O(l) time and 0(E) messages. 

At this point, for each pair e, / of edges that are related by their common endpoint v checks that 
e ~' / holds. If there is a violation at any vertex, the verifier rejects, and if not, the verifier accepts. The 
labels ~' give the blocks; vertex v is a cut vertex iff at least two blocks meet at v. 

Computing <f> dominates the time and message complexity; each other step takes 0(D) time and 0(E) 
messages. Noting that the verifier accepts each time with probability at least 1 — 1/V, Theorem 6.9 follows. 

□ 

7 Lower Bounds on Distributed Time 

In this section we give precise assumptions under which our distributed cut edge and cut pair algorithms 
achieve universal optimality. Let r denote the unique leader vertex in the graph. A vertex is quiescent in 
a given round if it does not send any messages or modify its local memory in that round. We adopt the 
following terminology from [PelOO, §3.4 & Ch. 24]. 
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Definition 7.1. A distributed algorithm has termination detection if r has a local boolean variable done, 
initialized to false, so that done is set to true exactly once, in the last round of the algorithm. A distributed 
algorithm has a single initiator if, except for r, every vertex is quiescent until it receives a message. 

The state of a vertex means the contents of its memory. We omit the straightforward inductive proof of 
the following standard proposition. 

Proposition 7.2. Let two graphs both contain a vertex v and have the same graph topology and node IDs 
in the distance-d neighbourhood of v. If the same deterministic distributed algorithm is run on both graphs, 
the state of v is the same in both instances for the first d — 1 rounds. For a randomized algorithm, the 
distribution over states of v is the same. 

For a graph G, a vertex v S V(G) and an integer t > 3, we now define graphs G c and G p that implicitly 
depend on I and v. Specifically, let G c denote the graph obtained from G by attaching a £-edge cycle to G 
at v, and let G p denote the graph obtained from G by attaching a (£ — l)-edge path to G at v, as shown in 
Figure 7. Give corresponding vertices Vi in the two graphs the same ID. 

Theorem 7.3. Any deterministic distributed algorithm for finding all cut edges that has termination detec- 
tion takes at least T>/2 rounds on every graph. 

Proof. Consider for the sake of contradiction a graph G upon which the algorithm terminates in t < T>/2 
rounds. Let v be any vertex of distance at least V/2 away from r, and let £ — 2t + 2. By Proposition 7.2, 
the algorithm also sets done := true at r on G p and G c in t rounds, so the algorithms terminate then. 

Now consider V£/ 2 ; using Proposition 7.2 again, we see that its state is the same at termination in both 
instances. Since the edges incident to Vg/2 are cut edges in G p but not in G c , they must have been incorrectly 
classified at V(/2 in at least one instance. □ 

If we assume that the algorithm has a single initiator instead of assuming termination detection, a similar 
argument works. We use the following lemma, whose easy inductive proof is omitted. 

Lemma 7.4. In an algorithm with a single initiator, every vertex at distance t from r is quiescent for the 
first t rounds. 

Theorem 7.5. Any deterministic distributed algorithm for finding all cut edges that has a single initiator 
takes at least V/2 rounds on every graph. 

Proof. Suppose the algorithm terminates in t < T>/2 rounds on a graph G. Let v be any vertex of distance 
at least T>/2 away from r. Then by Proposition 7.2 the algorithm also terminates in t rounds on G^' v and 
Gp' v . By Lemma 7.4 vertex v\ is quiescent during the entire execution of the algorithm on these new graphs; 
hence the incident edges cannot be correctly classified in both instances. □ 

For randomized algorithms we have the following lower bound. 

Theorem 7.6. Any randomized distributed algorithm with error probability less than 1/4 for finding all cut 
edges takes at least T>/4 rounds in expectation, if it has a single initiator or termination confirmation. 
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Proof. We use the same setup as in the proofs of Theorems 7.3 and 7.5. Markov's inequality shows that 
when running the algorithm on G, the time of termination t satisfies Pr[t < T>/2] > 1/2. The distribution 
on the state of the crucial vertex — v^m for termination confirmation, v\ for single initiator — is the same 
on both G c and G p at time T>/2. So of the > 1/2 probability mass of termination before T>/2, either 1/4 
incorrectly classifies edges of G c as cut edges or edges of G p as not cut edges. □ 

The same lower bounds hold for finding 2-edge-connected components and cut pairs, since the new edges 
of G c are in cut pairs, while the new edges of G p are not. It is straightforward to verify that our distributed 
algorithms can be implemented so as to have a single initiator and termination detection; then their universal 
optimality follows. 

If we do not require a single initiator or termination detection, and if we change our input model to allow 
additional parameters of G to be initially known at each node, neighbourhood cover techniques of [ElkOG] 
can be synthesized with our techniques to yield even faster algorithms for certain graph classes. Elkin used 
these techniques to obtain distributed MST algorithms faster than 0(V) on some graphs. 

8 Parallel Cut Pairs on the EREW PRAM 

In this section we give a parallel cut pair algorithm of time complexity 0(log V) for the EREW PRAM. 
Computing the OR of n bits has a lower bound of 0(log n) time in this model; from this an easy combinatorial 
reduction yields an Q (log V) time lower bound for finding all cut pairs of a graph, so our algorithm is time- 
optimal. As in Section 4.4 we assume without loss of generality in this section that G is 2-edge-connected. 

We will require several common subroutines. First, we need a Las Vegas randomized spanning forest 
subroutine taking 0(V + E) work and space and 0(\ogV) time, due to [HZ01]. An ear decomposition can 
be computed in the same randomized complexity using the approaches in [MSV86, MR92] and plugging 
in the result of [HZ01] for the spanning forest subroutine. Expression evaluation of an n-node tree can be 
accomplished deterministically in 0(n) work and space and O(logn) time (e.g. see the book [JaJ92, Ch. 3]). 
We let T(n), S(n), W(n) denote the time, space, work complexity to sort n numbers of length O(logn) bits; 
we give references to the best known algorithms for this problem in Section 1.2 (they are deterministic). 
First, we give our Monte Carlo cut pair algorithm. 

Theorem 8.1. There is a parallel algorithm to compute all cut pairs with probability at least 1 — 1/V in 
0{\ogV + T{E)) time, 0(E + S(E)) space, and 0(E + W{E)) work. 

Proof. We implement Algorithm 4 distributively. First, we claim we can implement the subroutine Rand-6- 
Bit-Circ distributively to generate a random 0(logV^)-bit circulation in logarithmic time and linear work; 
the completion steps (Algorithm 1) are accomplished via a call to expression evaluation in which we compute 
the expression 0(e) := ®f e s(v)\e^(f) ^ or eacn tree edge e = {v,p(v)}. We implement Line 2 of Algorithm 
4 via a sort. □ 

8.1 Las Vegas Cut Pair Algorithm 

The verifier for our parallel cut pair algorithm works by attempting to construct the 2-cactus of G which 
acts as a certificate for all of the cut pairs. Our terminology is derived from a more general sort of cactus 
originally due to [DKL76] that acts as a certificate for all minimum edge cuts. Say that u = v in G if the 
edge-connectivity between u and v is at least 3; it is easy to show (e.g. using the max-flow min-cut theorem) 
that = is an equivalence relation. 

We now define how to contract 1 a graph by an equivalence relation. Contraction may introduce parallel 
edges and/or loops; for this reason in the rest of the paper, when we say a graph, we mean a multigraph, 
which may have parallel edges and/or loops. Given a graph G and an equivalence relation R on the vertices 
of G, the contraction denoted G/R is another (multi)graph. For each equivalence class C of R, G/R has 

4 When we "contract," we may identify non-adjacent vertices; this contrasts with the more common meaning of "contract" 
in the context of graph minors. 
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Figure 8: Left: a graph. For ease of visualization, every node is labelled by its equivalence class of =, except 
for nodes in singleton classes; every edge is coloured according to its cut class, except for edges in no cut 
pair, which are black. Right: the 2-cactus Ca(G) defined equal to G/=. 

a vertex labelled G. For each edge e of G, G/R has an edge from the vertex labelled by the equivalence 
class of u to that of v. Since the edges of G correspond bijectively to the edges of G/R, we will speak of the 
graphs as having the same edge set (alternatively, one may think of each edge of G having a distinct label 
which is inherited by the corresponding edge of G/R). 

Definition 8.2. The 2-cactus Ca(G) of G is G/ = . 

An example of a 2-cactus is given in Figure 8. 

Proposition 8.3. (a) For any equivalence relation R on V, every cut pair of G/R is a cut pair of G. (b) 
Every cut pair of G is a cut pair of Ca(G). 

Proof. Both results use a common observation. As before let S(Z) denote the set of edges with exactly one 
endpoint in Z. Let A be a set of equivalence classes of R (which we may view as a vertex set in G/R) and 
let UA be the union of those classes (which is a vertex set in G). Then from the definition of contraction it 
is easy to see the following: 

The edge set <5(UA) of G is the same as the edge set S(X) of G/R. (X) 

To prove (a), let 6(X) = {e, /} be a cut pair of G/R, where AT is a set of equivalence classes of R (vertices 
of G/R). Let UA be the union of these classes. By (J), in G the edge set <5(UA) is precisely {e, /} giving 
the needed fact that {e, /} is a cut pair of G. 

To prove (b), let 5(S) — {e, /} be a cut pair of G. By the (weak) max-flow min-cut theorem s ^ t holds 
for each s € S,t $ S. So for any s € S and for any t = s we have t € S, i.e. S is a union of some equivalence 
classes of =. Call this set of classes A (so in the earlier notation, S — UX). By (|), in G/ = (which is Ca(G)) 
we have S(X) — {e, /}, so {e, /} is a cut pair of Ca(G) as needed. □ 

Now we recall the earlier convention (from Section 2.1) of using <j> to denote a (random) 6-bit binary 
circulation, wherein for each i the sets {e | <pi(e) — 1} are independent binary circulations selected uniformly 
at random. Recall also Corollary 3.3 which (together with the assumption that G is 2-edge-connected) says 
that 4>{e) = 4>{f) always holds when {e, /} is a cut pair, and holds with probability l/2 h otherwise. Define 
an illusory cut pair to be a pair of edges {e, /} that has 4>{e) = (j){f) but is not a cut pair. Our strategy for 
the parallel algorithm will be to define another relation =' so that = and =' will agree when there are no 
illusory cut pairs; we will then use Proposition 8.3 to verify that there are no illusory cut pairs. 
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8.1.1 Cactuslike Graphs 

The relation =' must provide an alternate way of constructing Ca(G), when there are no illusory cut pairs. 
To this end we examine the properties of Ca(G) in more detail. We fix some terminology: a closed walk has 
distinct edges but may repeat vertices; a simple cycle is any closed walk without repeated vertices. Thus a 
simple cycle of length 1 is a loop, and a simple cycle of length 2 is a parallel pair of non-loop edges. 

An ear decomposition of G is a sequence of graphs Go C G\ C G2 C • • • C Gk = G such that Go is just 
a vertex and each Gi is obtained from Gj_i by attaching a simple cycle (a closed ear) or path with both 
endpoints in Gj_i (an open ear). It is well-known that a graph is 2-edge-connected if and only if it admits 
an ear decomposition. The path or cycle added to Gj_i to get Gi is called the ith ear of G, and it is denoted 
E t . 

We omit the straightforward proof of Proposition 8.4. 

Proposition 8.4. Every pair of nodes in Ca(G) has edge-connectivity equal to 2. 

Call a graph cactuslike if every pair of nodes has edge-connectivity equal to 2. 

Proposition 8.5. The following are equivalent for any graph: (a) it is cactuslike; (b) in some ear decom- 
position, all its ears are closed; (c) in every ear decomposition, all its ears are closed; (d) every edge lies in 
exactly one simple cycle. 

Proof. Trivially, (c) implies (b). It is easy to see that (b) implies (a) by induction on the ears. 

We now prove the contrapositive of (a) (d) . If (d) is false there are two simple cycles C\ , G2 both 
containing an edge e; since the Ci arc different there is another edge / with / G Gi , / ^ G 2 (WOLOG). Let 
P be the inclusion- minimal subpath of C\ containing / with both its endpoints in C2. Then we get three 
edge-disjoint paths connecting the endpoints of P: P itself plus two in G2. By the definition of cactuslike, 
(a) is false, and we are done. 

We also prove the contrapositive of (d) =>■ (c). If (c) is false there is an ear decomposition with an open 
ear Ei with endpoints u,v. Since Gj_i is 2-edge-connected, there are two different simple u-v paths in Gj_i. 
Combining these paths in turn with Ei gives two simple cycles having at least one common edge, so (d) is 
false. □ 

From Proposition 8.5(c) we obtain the following corollary by induction on the ears. 

Corollary 8.6. In a cactuslike graph, for any ear decomposition, the cut classes are the same as the nons- 
ingleton ears. 

We know from Proposition 8.4, along with the definitions of Ca(G) and cactuslikc, that Ca(G) is a 
cactuslike contraction of G, and by Proposition 8.3 the cut classes of Ca(G) are the same as the cut classes 
of G. The following converse will be very useful. 

Lemma 8.7. Let R be an equivalence relation for which G/R is cactuslike, and such that the cut classes of 
G/R and the cut classes of G are the same. Then R is the same as =. 

Proof. First, we establish that = refines R. Suppose otherwise, that there are vertices u, v related by 
but not by R. Then the vertices corresponding to the equivalence classes of u and v in G/R are different. 
Since G/R is cactuslike, there is a cut pair separating those vertices. But then (J) shows this cut pair also 
separates u from v in G, contradicting the fact that u = v. 

Now, we establish that R refines =. Suppose otherwise, that there are vertices u, v related by R but not 
by =. Since u^v there is a cut pair {e, /} in G separating u from v. Thus G\{e, /} has two connected 
components, one containing u and one containing v, and since uRv, we see G / R\{e, f} is connected. So G 
and G/R have different cut pairs, hence different cut classes, which is the needed contradiction. □ 
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8.1.2 Pinching Ears 

In this subsection we develop an algorithmic ear-based tool to construct =. First we need the following. 

Lemma 8.8. Every cut class of G lies within a single ear of any ear decomposition of G. 

Proof. Suppose otherwise, that there is a cut pair {e, /} with e € -Ej and / € Gj_i. Since Gj_i is 2-edge- 
connected, Gj_i\/ is connected. But Gj\{e, /} is obtained by attaching 2 paths to Gj_i, and so is connected. 
By induction on the remaining ears we see that each Gj\{e, /} for j > i is connected; in particular for j = k 
this means G\{e, /} is connected, a contradiction. □ 

The cut classes of Ca(G) and G agree (Proposition 8.3), and by Corollary 8.6 the cut classes of Ca(G) 
are the same as its non-loop simple cycles. We reiterate for future reference: 

The cut classes of G are the same as the non-loop simple cycles in Ca(G). ((}) 

We now define a pinching operation whose effect is also to turn a cut class into a cycle. Let a given ear 
Ei have vertices and edges vq, e±, v%, ea, i>2) • • • , e z , v z in that order, where vq = v z iff the ear is closed. For a 
given subset U — {e^-m, ej(-z), ■ ■ ■ , e j(t)} of the ear's edges indexed such that j(l) < j{2) < • • • < j(t), let Ru 
denote the equivalence relation on the ear's vertices consisting exactly of the pairs 

Ru ■= {{^(l),%(2)-l}J^(2),Wj(3)-l},---,{%(t-l),^(t)-l},{^(i),^(l)-l}}. 

Thus in the "pinched" ear Ei/Ru, the set U forms a simple cycle. For example if Ru consists of just one 
edge, then that edge becomes a loop in Ei/Ru ■ 

Definition 8.9. Denote the cut classes contained in ear i as U[l], U[2], . . . , U[s], then define =, to be the 
equivalence relation defined by the transitive closure (-Rmil U Ru[2] U ••• U Ru\ s ])*- 

In other words, =t simultaneously pinches all cut classes appearing in ear Ei. 

Lemma 8.10. For each i, =j refines =. 

Proof. Using Lemma 8.8 and the preceding definitions, it is necessary and sufficient to show that for each 
cut class U, every pair of vertices related by Ru is also related by =. 

It is not hard to see that we may use U to decompose G into \U\ 2-edge-connected graphs linked in 
a cycle by edges of U. We illustrate this on the left side of Figure 9; compare with Figure 3 on page 8. 
Moreover, it is clear that the order of appearance of U in Ei is the same as the cyclic order of U in this 
decomposition. Therefore, adopting the notation in the definition of Ru, we label the 2-edge-connected 
graphs Gi, G2, ■ ■ ■ , G\u\ such that edge e.j( x ) joins G x _i to G x (or if x = 1, G\u\ to Gi). The notation is 
illustrated on the right side of Figure 9. 

With this setup, we now show that every pair of edges related by Rjj is also related by =. First, 
Vjt x \ = Vjf x+ u_i for 1 < x < \U\ since these two vertices are 2-edge-connected in G x and also are linked by 
a third disjoint path going the long way around the cycle (using all of U). Establishing that VjM = 
is similar. □ 

Proposition 8.11. The equivalence relation (Uj =i)* is the same as =. 

Proof. This is where we make use of Lemma 8.7. The main part of the proof is to show that the simple 
cycles in G/(U; =j)* are the same as the simple cycles in G/= (which by (^>) are the same as G's cut 
classes). Supposing we can prove this, it follows by two applications of Proposition 8.5(d) that G/(Uj =i)* 
is cactuslike, then Corollary 8.6 shows that G/(U, =,)* has the same cut classes as G, then applying Lemma 
8.7 we are done. 

Using Lemma 8.8 any cut class U of G becomes a simple cycle in G/Ru and hence a closed walk in 
G/(Ui =i)* (because that graph is a contraction of G/ Ru). Moreover, we know by (<0>) that every cut class 
of G is a simple cycle of G/=, and since (Uj =j)* refines =, it follows that every cut class of G is becomes a 
simple cycle in G/(U, =»)*. Reiterating, every cut class of G is a simple cycle of G/(Ui =j)*. 
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Figure 9: Left: the structure of a 2-edge-connected graph with respect to a cut class (shown with a cut class 
of size 4); each hatched region indicates a 2-edge-connected subgraph (possibly a single vertex). Right: the 
notation used in the proof of Lemma 8.10. 

Could there be any other simple cycles in G/(Uj =i)* — one which is not just a cut class of G? We will 
show the answer is no, which will complete the proof. Suppose that G is any simple cycle in G/(Uj =,)*. 
Since (Uj =j)* refines = , we may view G/= as a contraction of G/(Uj =i)*, thus the image of G in G/= is a 
closed walk. But any closed walk is an edge-disjoint union of simple cycles (this is just a decomposition of 
an Eulerian graph), and the simple cycles in G/= are the cut classes of G; so G is a union of cut classes of 
G. But the previous paragraph establishes each cut class of G becomes a simple cycle in G/ (lit =i)*; and in 
G/(Uj =i)* it is impossible for G to be simultaneously a single simple cycle and a union of more than one 
simple cycle. Thus G is just a single cut class of G, as needed. □ 

8.1.3 Detecting Errors 

In the algorithm we are designing, we don't know the cut pairs; rather, we have computed <j) and know that 
with high probability, cj> labels edges by their cut class. We compute the following instead. 

Definition 8.12. For the «th ear Ei, enumerate {0(e) | e £ Ei} as {xi,X2, ■ ■ ■ ,x s }. Let W[k) denote the 
set {e G Ei | 4>{e) = Xk} and define to be the equivalence relation defined by the transitive closure 
(Rw[t] U Rw[2] U • ■ • U R W [s})* ■ 

In other words, =j simultaneously pinches all sets of common 0- value in ear Ei. 

Note first that if there are no illusory cut pairs, then the sets of common 0- value are the same as the cut 
classes, and so =[ is the same as =j. Define the equivalence relation =' to be equal to (U; =^j* ■ 

Theorem 8.13. There is a Las Vegas parallel algorithm to compute all cut pairs in O (log V + T(E)) time, 
0{E + S{E)) space, and 0(E + W(E)) work, in expectation. 

Proof. Our algorithm computes H := G/= and tries to verify all cut pairs. To compute the relations Rw[i] 
on each ear, we sort the edges on that ear lexicographically according to the pair (<f)(e),pos(e)) where pos{e) 
is the position along the ear. Then to compute the transitive closure =' of their union, we build an auxilliary 
graph on vertex set V and draw an edge for each pair of vertices that is related by some Rwu] on some 
ear; then the equivalence classes of =' are the connected components of this auxilliary graph. This can be 
done using the connected components routine of [HZ96]. From this, computing the contracted multigraph 
H takes constant time and linear work. 

Now we check if H is cactuslike, by computing an ear decomposition and seeing if all ears are closed. 
First, if H is not cactuslike, by Proposition 8.11, the verifier can reject since there is an illusory cut pair 
(since =^='). Second, if H is cactuslike, then its cut classes are the same as the ears. Using this fact, and 
sorting all edges by their <f> value, the verifier accepts iff every pair {e, /} with <j)(e) = 4>(f) is a cut pair of H. 
By Proposition 8.3(a), the verifier will reject when there is an illusory cut pair, and accept otherwise. □ 
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9 Future Work 



At the most basic level, it would be interesting to push further and find efficient algorithms for higher types 
of connectivity, such as finding all 3-edge-cuts in 0(E) sequential time or 0(D) distributed time. The state 
of the art for this problem in the sequential model is 0(V 2 ) time [GI91, KR91]. It would also be interesting 
to reduce the complexity of our parallel cut pairs algorithm to linear work and logarithmic time; it seems 
plausible that another approach would avoid radix sort. 

It is possible to deterministically compute the cut edges in the distributed model using OiV) time and 
0(E) messages, as was shown in the thesis [Pri05]. (The approach is based on the observation that {v,p(v)} 
is a cut edge if and only if low(v) > v and high(v) < v + desc(v).) However, we do not know of any 
deterministic analogues of our distributed cut pair or cut vertex algorithms. 

It would be interesting to know if our distributed cut vertex algorithm could be synthesized with the 
cut vertex algorithm of [Thu97] to yield further improvement. Alternatively, a lower bound showing that no 
0(2?)-time algorithm is possible for finding cut vertices would be very interesting. 
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